FLAME: A Fast Large-scale Almost Matching Exactly Approach to Causal Inference

نویسندگان

  • Sudeepa Roy
  • Cynthia Rudin
  • Alexander Volfovsky
  • Tianyu Wang
چکیده

A classical problem in causal inference is that of matching, where treatment units need to be matched to control units. Some of the main challenges in developing matching methods arise from the tension among (i) inclusion of as many covariates as possible in defining the matched groups, (ii) having matched groups with enough treated and control units for a valid estimate of Average Treatment Effect (ATE) in each group, and (iii) computing the matched pairs efficiently for large datasets. In this paper we propose a fast and novel method for approximate and exact matching in causal analysis called FLAME (Fast Large-scale Almost Matching Exactly). We define an optimization objective for match quality, which gives preferences to matching on covariates that can be useful for predicting the outcome while encouraging as many matches as possible. FLAME aims to optimize our match quality measure, leveraging techniques that are natural for query processing in the area of database management. We provide two implementations of FLAME using SQL queries and bit-vector techniques.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Enhanced Fast Causal Network Inference over Event Streams

This paper addresses causal inference and modeling over event streams where data have high throughput, are unbounded, and may arrive out of order. The availability of large amount of data with these characteristics presents several new challenges related to causal modeling, such as the need for fast causal inference operations while ensuring consistent and valid results. There is no existing wo...

متن کامل

Fast Least Square Matching

Least square matching (LSM) is one of the most accurate image matching methods in photogrammetry and remote sensing. The main disadvantage of the LSM is its high computational complexity due to large size of observation equations. To address this problem, in this paper a novel method, called fast least square matching (FLSM) is being presented. The main idea of the proposed FLSM is decreasing t...

متن کامل

LES modeling for lifted turbulent jet flames

The LES method is an attractive approach for the simulation of turbulent jet flames. In this method, the effects of large scale structures controlling the mixing process are resolved while small-scale effects such as the leading-edge flames involved in the flame base dynamics are accounted for by the subgrid-scale models. The LES approach is examined in this study with a particular emphasis on ...

متن کامل

Matching as Nonparametric Preprocessing for Reducing Model Dependence in Parametric Causal Inference

Although published works rarely include causal estimates from more than a few model specifications, authors usually choose the presented estimates from numerous trial runs readers never see. Given the often large variation in estimates across choices of control variables, functional forms, and other modeling assumptions, how can researchers ensure that the few estimates presented are accurate o...

متن کامل

The Statistics of Causal Inference: The View from Political Methodology

Many areas of political science focus on causal questions. Evidence from statistical analyses are often used to make the case for causal relationships. While statistical evidence can help establish causal relationships, it can also provide strong evidence of causality where none exists. In this essay, I provide an overview of the statistics of causal inference. Instead of focusing on statistica...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1707.06315  شماره 

صفحات  -

تاریخ انتشار 2017